Picture for Jianbing Shen

Jianbing Shen

CogOmniControl: Reasoning-Driven Controllable Video Generation via Creative Intent Cognition

Add code
May 19, 2026
Viaarxiv icon

OccDirector: Language-Guided Behavior and Interaction Generation in 4D Occupancy Space

Add code
Apr 24, 2026
Viaarxiv icon

Multimodal Large Language Models for Multi-Subject In-Context Image Generation

Add code
Apr 08, 2026
Viaarxiv icon

Accelerating Training of Autoregressive Video Generation Models via Local Optimization with Representation Continuity

Add code
Apr 08, 2026
Viaarxiv icon

Clinical Cognition Alignment for Gastrointestinal Diagnosis with Multimodal LLMs

Add code
Mar 21, 2026
Viaarxiv icon

Bridging Scene Generation and Planning: Driving with World Model via Unifying Vision and Motion Representation

Add code
Mar 16, 2026
Viaarxiv icon

HanMoVLM: Large Vision-Language Models for Professional Artistic Painting Evaluation

Add code
Mar 11, 2026
Viaarxiv icon

Condition Errors Refinement in Autoregressive Image Generation with Diffusion Loss

Add code
Feb 02, 2026
Viaarxiv icon

Towards Geometry-Aware and Motion-Guided Video Human Mesh Recovery

Add code
Jan 29, 2026
Viaarxiv icon

From Human Intention to Action Prediction: A Comprehensive Benchmark for Intention-driven End-to-End Autonomous Driving

Add code
Dec 13, 2025
Viaarxiv icon